Uncovering Noun-Noun Compound Relations by Gamification
نویسندگان
چکیده
Can relations described by English nounnoun compounds be adequately captured by prepositions? We attempt to answer this question in a data-driven way, using gamification to annotate a set of about a thousand noun-noun compound examples. Annotators could make a choice out of five prepositions generated with the help of paraphrases found in the Google ngram corpus. We show that there is substantial agreement among the players of our linguistic annotation game, and that their answers differ in about 50% of raw frequency counts of the Google n-gram corpus. Prepositions can be used to describe the majority of the implicit relations present in noun-noun compounds, but not all relations are captured by natural prepositions and some compounds are not easy to paraphrase with the use of a preposition.
منابع مشابه
Standardised Evaluation of English Noun Compound Interpretation
We present a tagged corpus for English noun compound interpretation and describe the method used to generate them. In order to collect noun compounds, we extracted binary noun compounds (i.e. noun-noun pairs) by looking for sequences of two nouns in the POS tag data of the Wall Street Journal. We then manually filtered out all noun compounds which were incorrectly tagged or included proper noun...
متن کاملInterpreting Noun Compounds using Bootstrapping and Sense Collocation
This paper describes a bootstrapping method for automatically tagging noun compounds with their corresponding semantic relations. Our work takes advantage of the collocation of senses of the noun compound constituents and also word similarity. We exploit this to generate a set of noun compounds from a set of previously tagged noun compounds by replacing one constituent of each noun compound wit...
متن کاملA Taxonomy, Dataset, and Classifier for Automatic Noun Compound Interpretation
The automatic interpretation of noun-noun compounds is an important subproblem within many natural language processing applications and is an area of increasing interest. The problem is difficult, with disagreement regarding the number and nature of the relations, low inter-annotator agreement, and limited annotated data. In this paper, we present a novel taxonomy of relations that integrates p...
متن کاملUsing Verbs to Characterize Noun-Noun Relations
We present a novel, simple, unsupervised method for characterizing the semantic relations that hold between nouns in noun-noun compounds. The main idea is to discover predicates that make explicit the hidden relations between the nouns. This is accomplished by writing Web search engine queries that restate the noun compound as a relative clause containing a wildcard character to be filled in wi...
متن کاملUsing Relations to Interpret Anaphora
In this paper we present a novel framework for resolving bridging anaphora. The new framework is based on the core set of relations that have been used to describe an entirely different linguistic process, the process of generating a compound noun from two different nouns. We argue that the linguistic processes of compound noun generation and the use of NP anaphora are alike hence have to use t...
متن کامل